r 


AD-757  106 


SEQUENTIAL  ESTIMATION  OF  THE  LARGEST 
NORMAL  MEAN  WHEN  THE  VARIANCE  IS  UNKNOWN 


Saul  Blumenthal 


Cornell  University 


Prepared 


for: 


Office  of  Naval  Research 
National  Science  Foundation 


January  1973 


DISTRIBUTED  BY: 


National  Technical  Information  Service 
U.  S.  DEPARTMENT  OF  COMMERCE 

5285  Port  Royal  Road,  Springfield  Va.  22151 


v 


AD  757106 


DEPARTMENT 

OF 

OPERATIONS  RESEARCH 


I 

I 


V 


Reproduced  by 

national  technical 
information  service 

U  S  Deportmont  of  Commirci 

Springfield  VA  ??|  5| 


DTE0W' 

lEfflEiEflCfm 


MAR  10  1973 


•»  $/■'{  ‘  'Mi 

v  W'M* 


vkv-V" 

y:, 


COLLEGE  OF  ENGINEERING 

CORNELL  UNIVERSITY 

ITHACA,  NEW  YORK  14950 


•&  V  ■'  V- 


Approved  hr  public  r*Iecoi} 
Dittribatioo  tfatotted 


it- 

I 

.I 

*v$T 

S§5Jr 


% 


vV 


vC  •!'  '  .0 


-  _ij-'  ■  V  '■ 


l: 


DEPARTMENT  OF  OPERATIONS  RESEARCH 
COLLEGE  OF  ENGINEERING 
CORNELL  UNIVERSITY 
ITHACA,  NEW  YORK 


TECHNICAL  REPORT  NO.  164 


January  1973 


SEQUENTIAL  ESTIMATION  OF  THE  LARGEST 
NORMAL  MEAN  WHEN  THE  VARIANCE  IS  UNKNOWN 


by 

Saul  Blumenthal* 


•Cornell  University  and  New  York  University 


yTTETC 

~)r?rar?nnnE 

MAR  16  1973 

ISM  DTE 

Di^' 


•J 


Supported  in  part  by  Office  of  Naval  Research  Contract  ONR  N00014-67-A- 
0077-0020  at  the  College  of  Engineering,  Cornell  University,  and  by 
National  Science  Foundation  Grant  GK  14073  at  the  School  of  Engineering 
and  Science,  New  York  University. 


Reproduction  in  Whole  or  in  Part  is  Permitted  for 


S»’cunt\  (  I.ivsifu  •* 1 1 * *!i 


DOCUMENT  CONTROL  DATA  •  R &  D 

fSerunfr  cl*i  s  «*  r  1  natinri  of  title,  /im/c  at  t  uni  indc  \  in.*  be  entered  it /mm  the  i»t»*r.i//  report  i.  r/.-i  .  \itn  •  / 1 

1.  originating  activity  r  Cor/it  if.i  fe  author) 

Za.  REPORT  SECURITY  CLASSIFICATION 

Cornell  University 

UNCLASSIFIED 

Operations  Research  Department 

26.  GROUP 

Ithaca,  New  York  14850 

3  REPORT  TITLE 

SEQUENTIAL  ESTIMATION  OF  THE  LARGEST  NORMAL  MEAN  WHEN  * 

THE  VARIANCE  IS  UNKNOWN 

4.  descriptive  NOTES  (Type  ut  report  «inj# inclusive  dates) 

Technical  Report 

5-  AUTHORI5)  (Firsf  name,  middle  initial,  last  name) 

Saul  Blumenthal 

B«.  CONTRACT  OR  GRANT  NO 

N00014-67-A-0077-0020 

6.  PROJEC  T  NO. 


7».  TOTAL  NO.  OF  PAGES  76.  NO.  OF  REFS 

10 


9a.  ORIG1NA1  ~R‘S  REPORT  NUMOER(S) 

Technical  Report  No.  164 


tfb.  OTHER  REPORT  NOISI  fAny  of/ier  numbers  that  may  be  as  signed 
this  report) 


10.  DISTRIBUTION  STATEMENT 


This  document  has  been  approved  for  public  release  and  sale;  its  distribution 
is  unlimited. 


12-  SPONSORING  MILITARY  ACTIVITY 

Mathematics  Program 
Office  of  Naval  Research 


KWHilUWiwl’Hil'llitintWtlH 


IS.  ABSTRACT 


Given  n  observations  from  each  of  k  populations  whose  distributions  differ 
by  a  location  parameter,  the  value  of  the  largest  parameter  is  to  be  estimated  using 
the  largest  value  of  the  k  sample  means.  It  is  desired  to  design  a  sampling  rule 
which  guarantees  that  the  Mean  Squared  Error  (M.S.E.)  0f  the  estimate  does  not  exceed 
a  given  bound  when  the  distributions  have  a  common  but  unknown  scale  parameter. 

A  sequential  sampling  scheme  is  demised  based  on  an  estimate  of  the  scale  parame¬ 
ter  and  a  "least  favorable"  configuration  of  the  location  parameters.  The  sample  size 
characteristics  of  the  sampling  plan  studied  under  mild  restrictions  on  the  distribu¬ 
tions  involved.  The  M.S.E.  of  the  resulting  estimator  is  studied  under  the  additional 
assumption  of  normality.  A  brief  discussion  is  given  of  an  alternate  sequential  plan 
which  uses  information  in  the  sample  regarding  the  configuration  of  the  location 
parameters. 
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1.  INTRODUCTION  AND  SUMMARY 

Given  observations  from  k  populations  whose  distributions  are  the  same 
except  for  the  value  of  a  location  parameter,  it  is  desired  to  estimate  the 
value  of  the  largest  of  the  k  location  parameters.  A  natural  estimator  for 

4 

this  problem  is  the  largest  of  the  k  sample  means.  Measuring  the  perfor¬ 
mance  of  the  estimator  by  its  mean  squared  error  (M.S.E.),  it  is  possible  to 
choose  a  sample  size  so  that  the  M.S.E.  is  bounded  by  a  given  constant  regard¬ 
less  of  the  values  of  the  location  parameters.  If  the  common  distribution 
function  involves  an  unknown  scale  parameter,  however,  then  no  such  choice  of 
sample  size  is  possible,  but  a  sequential  rule  of  the  type  studied  by  Chow 
and  Robbins  [4]  can  be  constructed.  Its  limiting  Droperties  will  be  studied 
here. 

Point  estimation  of  the  largest  parameter  for  known  variances  has  been 
studied  before  by  Blumenthal  and  Cohen  [2],  [3]  and  Dudewicz  [5],  and  interval 
estimation  by  Saxena  and  Tong  [7],  and  Dudewicz  and  Tong  [6].  Interval  esti¬ 
mation  with  unknown  variance  has  been  examined  by  Tong  [9],  [10].  The  use 
of  the  largest  observed  mean  has  no  optimality  properties  and  in  fact  for 
k  =  2  is  known  to  be  inadmissible  [2]  for  M.S.E.  However,  since  the  analytic 
form  of  the  competing  estimators  is  rather  involved,  it  is  suspected  that  this 
natural  estimator  would  be  widely  used  and  its  properties  would  therefore  be 
of  interest.  In  the  next  section,  the  sample  size  for  the  sequential  stopping 
rule  is  shown  to  behave  well  under  mild  restrictions  on  the  distributions. 

The  behavior  of  the  M.S.E.  of  the  estimator  formed  from  the  sequential  stopping 
rule  is  studied  only  for  normal  distributions. 
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The  risk  function  depends  on  the  differences  between  the  largest  parame¬ 
ter  and  the  other  parameters.  In  Section  2  we  derive  a  conservative  procedure 
based  on  the  maximum  risk  as  a  function  of  these  differences.  In  [1],  a 
sequential  procedure  has  been  studied  which  attempts  to  capitalize  on  the 
information  in  the  sample  about  these  differences,  when  the  variance  is  known. 
In  Section  3,  this  procedure  is  extended  to  the  case  of  unknown  variance,  and 
a  few  of  its  properties  are  outlined. 


2.  CONSERVATIVE  PROCEDURES 

Let  Xi  (1  '  i  <  k)  be  an  observation  from  the  ith  population  having 
c.d.f.  Ffx.e^)  (e.,  a  real  number).  We  assume  that  the  family  of  c.d.f. ’s 
F(x,e)  satisfy 


(2.1) 


(i)  F(x,0)  =  F(x-e),  where  F(x)  is  a  c.d.f. 

(ii)  F(x)  =  1  -  F(x)  or  f(x)  =  f(-x)  for  all  x, 

(iii)  The  family  of  density  functions  (f(x-6)}  has  monotone 

likelihood  ratio. 

00 

(iv)  /  xf (x)dx  =  0  . 

•  00 


Define 


(2.2) 


C*  *  maxfXj, . . . ,X^);  6*  =  max(61 , . . . ,0^)  . 


The  ordered  0's  will  be  denoted  e^j  <_  •••  The  M.S.E.  or 

risk  function  of  X*  is  easily  seen  to  be 
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(2.3)  R(X*;  e1#...t8j  =  E(X*-e*)2  *  /  (x-0*)2d  n  F(x-0.)  . 

-®  i=l  1 


To  choose  an  appropriate  sample  size,  this  risk  must  be  studied  in  further 
detail  which  is  done  below. 


Lemma  1.  Let  F(x,8)  satisfy  condition  (2.1).  Then 


(2.4)  sup  R(X*;  6  ,...,0.)  »  R(X*;;  0\6*, . . .  ,6*)  =  /  x2dFk(x) 

(®  j  »  •  •  • » 9j^)  "® 


and 


(2.5)  lim  R(X*;  0  ,...,0.)  =  /  x2dF(x)  . 

e[k]"6[k-l]^ 


Proof :  Write 


R(X*;  e.,...,©.)  -  /  (x-8*)2dnF(x-0. )  +  /  (x-8*)2dnF(x-0. ) 

1  0* 


(2.6)  ® 

=  /  x  d(nF(A.*x)  -  nF(A.-x)] 

0  l  l 


2 

where  A.  *  0*  -  0.  >  0.  Since  x  is  monotone  increasing  over  (0,®),  the 

l  i  = 

expectation  will  be  dominated  by 


/  x2dH(x) 
0 


if  H(x)  ^  (HF(Aj*x)  -  nF(A.-x)]  (see,  for  example.  Lemma  4.1.2  of  Dudewicz 

k  k 

[5],  or  integrate  by  parts).  That  H(x)  can  be  taken  as  [F  (x)  -  F  (-x)] 
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is  a  consequence  of  Theorem  1  of  Saxena  and  Tong  [7].  Equation  (2.5)  follows 
by  letting  all  A^  -*■  «  (except  one  which  is  fixed  at  zero). 

Note  that  for  k  >  2,  the  risk  when  all  the  9's  are  equal,  (2.4), 
exceeds  the  risk  when  the  difference  between  9*  and  the  others  becomes 
infinite,  (2.5).  (See  Lemma  4.2.3  of  [5].)  Regardless  of  k,  the  latter 
risk  is  just  the  variance  of  a  single  X^  as  would  be  expected.  Id  the  case 
k  *  2,  the  two  risks  (2.4)  and  (2.5)  are  equal  as  was  shown  by  Blumenthal  and 
Cohen  [2]  without  the  assumption  of  monotone  likelihood  ratio. 

Suppose  that  each  is  in  fact  X^  ^  and  is  the  average  of  n  inde¬ 
pendent  identically  distributed  observations  say  (1  j  <_  n) ,  so  that 

F(x)  is  really  Fn00.  A  conservative  procedure  which  will  guarantee  that 
the  risk  of  estimating  9*  by  X*,  is  no  greater  than  r  (given)  for  any 
parameter  configuration  is  to  choose  n  as  the  smallest  integer  such  that 

00 

/  x2dF^(x)  <  r  . 

•  CD 

Suppose  in  addition  that  for  each  n,  Fn(x)  is  of  the  ^°"m  Gn (x/n/o) 

2 

where  a  is  the  variance  of  a  single  observation.  Assume  that  in  addition 
to  (2.1), 

o° 

(2.7)  /  x2g(x)dx  =  1  . 

•  CO 

Then  the  risk  function  is 

(2.8)  R(X*;  Qy...,Qk)  -  (o2/n)|y2dnGn(y  ♦  (A./n/o))  <  (o2/n)Jy2dG*(y)  =  \o2/n. 

2 

The  conservative  choice  of  n  will  then  be  the  smallest  n  >  (Ano  /r). 
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2 

If  ?’  e  vnTv.?  of  o  is  r.ot  known,  a  reasonable  way  to  proceed  is  to  form 
2 

an  estimate  of  a  by  defining 


(2.9a) 


4  -  d/n-l)  j  (Y4J  -  X  „)2  l<i« 
1=1  J 


and 


(2.9b) 


i=l 


Sn  *  (Wl  4  • 


2  2 

Clearly  Sn  is  a  consistent  estimator  of  o  ,  and  a  sequential  stepping 
rule  can  be  formulated  as  follows:  Let  N  be  the  first  n  such  that 


(2.10)  n  >  (AnS^/r)  . 

1  2 

(If  desired,  n  may  be  restricted  to  exceed  a  given  value  m,  and  (—  +  S^) 

2 

should  be  substituted  for  S  if  G  (x)  is  discrete.)  Then  9*  will  be 

n  n 

estimated  by  X?.  =  max  X. 

™  i^i^k  ’ 

The  following  theorem  giving  the  behavior  of  N  is  a  restatement  of 

the  main  Theorem  of  Chow  and  Robbins  [4]  which  applies  here  directly.  (In 
2 

[4],  Sn  is  defined  by  (2.9b)  with  k  =  1 .  It  is  easy  to  check  that  their 
proofs  are  not  affected  by  having  k  >  1.) 

2 

Theorem  1.  Suppose  lim  A  =  A  <  ®.  Let  A  =  (o  /r). 

iv-ko  n 

2 

(2.11a)  Py  a2 lN  <  °°]  =  1  for  every  y  and  o 

(2.11b)  lim  N  =  ®,  a.s. 

A-*» 
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(2.11c) 

lim  (N/AX)  =1  a.s. 

X-*«o 

(2. lid) 

lim  (E(N)/AX)  =  1. 

X-*» 

Note:  Since 

lim  G  (x)  =  4>(x),  generally  (though  not  necessarily  always). 

n-H»  n 

(2. lie) 

00 

A  a  /  X2d$k(x)  =  A*  . 

For  general  distributions,  it  is  not  possible  to  obtain  much  information 

2 

about  the  risk  function,  E(X*  -  8*)  .  Since  it  is  not  true  that  X*  forms 
a  martingale,  theorems  relating  to  the  moments  of  randomly  stopped  martingales 
cannot  be  applied.  A  lemma  of  Tong  [10}  shows  that  X*  converges  stochas¬ 
tically  to  0*  as  r  -*■  0,  but  little  else  can  be  said  in  the  general  case. 

Hereafter,  assume  that  all  distributions  are  normal.  In  this  case  the 
distribution  of  /N(X*  -  0*)/o  is  independent  of  N.  In  particular 

E{  (X*  -  0*)2 )N}  =  {(o2/N)JyZdlT*(y  ♦  (A.^f/o))} 

(2.12) 

<  {1/N}A*cj  . 


The  risk  of  X*  is  thus  related  to  E(N’*).  From  (2.12),  we  see  that 
(l/r)E{(X*  -  e* ) 2 1 N>  <  { 1/N)A*X .  Thus  from  (2.11c)  we  can  conclude  that 

(2.13)  lira  (l/r)E{ (X*  -  8*)Z|N)  <  1  a.s. 

X-xo 


This  limit  is  conservative  as  can  be  seen  by  the  following  argument.  Consider 


L  J 
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y  dn$(y  +  (.!L/n/o))  . 


Applying  the  bounded  convergence  theorem  to  this  integral  we  see  that 
(recalling  that  *  0  for  exactly  one  i  if  9^  is  unique) 

(*.14)  lim  I  y2dfl$(y  +  (A.^n/o))  =  |  y2d$(y)  =  1  . 

n-*»  >  1  ) 


By  (2.11b),  it  is  seen  that  (2.14)  implies  that 


(2.15) 


lim  I  y2dII4>(y  ♦  (A./N/o))  =  1,  a.s. 
r->0  i  1 


Thus,  using  (2.15)  and  (2.11c),  we  conclude  from  (2.12)  that 


(2.16)  lim  (l/r)E{  (XjA  -  0*)2|N}  =  (1/A*)  a.s.  (when  6rifl  is  unique)  . 
r-*0  N  lKJ 


When  is  not  unique,  the  above  can  be  modified,  as  follows.  Suppose 


6*  =  e[kJ  *  e[k_i] 


s  0[k-s+l] *  Then 


(2.17)  lim  (l/r)E{  (X?:  -  0*)2|N}  *  (1/A*)  fy2d*s(y), 

r+0  N  1 


a.s. 


By  Corollary  4.2.4  [5),  the  right  sides  of  (2.16)  and  (2.17)  are  strictly 
less  than  one  when  k  >  2,  s  <  k,  and  are  equal  to  one  for  k  =  2,  or  s  *  k. 
Note  that  (2.17)  depends  only  on  the  multiplicity  of  0*  and  otherwise  is  true 
for  any  parameter  configuration. 

If  on  the  other  hand,  we  regard  r  as  fixed  and  examine  large  o  values, 
from  (2.11c),  it  is  seen  that 
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(2.18)  lim  I  y2dna>(y  ♦  (A./FT/a))  =  y2dn*(y  ♦  (Ai*^s7r)),  a.s., 

a-**  )  J 

and  therefore  using  (2.11c)  again  in  (2.12) 


(2.19)  lim  (1/r) E{  (X*  -  6*)2|N)  =  (1/A*)  ]y2dlW(y  +  (A./EV/r))  <  1,  a.s. 
o-*»  '  1 


Note  that  different  limits  are  obtained  for  the  conditional  risk  in  (2.16) 
and  (2.19)  as  X  «  depending  on  whether  o  »  for  fixed  r  or  r  0 
for  fixed  o. 

Next,  the  unconditional  risk  will  be  investigated,  namely 
(l/r)EE{(X^  -  0*)2 |N> .  From  (2.12)  it  is  seen  that  this  risk  Is  related  to 
E(N_1).  The  following  represents  a  slight  generalization  of  Theorem  3*  of 
Starr  [8]. 

Theorem  2.  Let  the  minimum  sample  size  in  (2.10)  be  denoted  as  m.  Let 
An  in  (2.10)  be  given  by  A*  (see  (2. lie))  for  each  n.  Let  u>  >  0  be 
given. 


lim  (A*X)UE(N"U)  »  1 

for 

m  > 

1  ♦ 

(2w/k) 

X-*» 

(2.20) 

•  l  ♦  (uiU)’1/r(w)) 

for 

m  <* 

1  + 

(2w/k) 

*  QO 

for 

m  < 

1  ♦ 

(2w/k) 

The  proof  is  outlined  in  the  Appendix. 

The  following  lemma  is  needed  to  obtain  the  limit  of  the  unconditional 
risk  from  (2.20)  and  either  (2.15)  or  (2.18). 


▼ 


I 

I 

I 

I 

I 


1 

l 

I 

1 
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Lemma  2.  Let  M  be  a  positive  random  variable  depending  on  a  parameter  t 
with  the  properties 

(2.21a)  lim  Mai  a.s. 

t-*» 

(2.21b)  lim  EM  =  1  . 

t-*» 


Let  G(x)  be  a  bounded  function  satisfying  either 


(2.22a) 

or 


lim  G(x)  «  G* 
x+1 


(2.22b) 


lim  G(x)  «  G*  . 
x-*« 


Then  in  case  (2.22a) 

(2.23a)  lim  E(MG(M))  «  G* 

t-*» 

or  in  case  (2.22b) 

(2.23b)  lim  E(MG(t/M))  -  G*  . 

t-H» 

Proof:  Consider  (2.23a).  We  shall  show  that 


(2.24) 


lim  E[M(G(M)  -  G*)J  ■  0  . 
t-*» 


By  (2.22a),  given 


e  >  0,  there  is  a  6 


«(t)(6(t)  -  0 


as 


e  -*•  0]  s.t. 


% 


A 


4 


I 
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I 

I 


|G(M)  -  G*|  <  e  for  1-6  <  M  <  1+6, 


and 

(2.25) 


1+6 

/  M(G(M)  -  G*) 
1-6 


(l*«)e 


By  the  boundedness  of  G(x),  |G(x)  -  G*j  <  B  (say)  and  we  obtain 


(2.26) 


as  well  as 


1-6 

/  M(G(M)  -  G*) 
0 


<  BP(M  <  1-6) 


(2.27) 


/  M(G(M)  -  G*)  <  B  /  M  . 

1+6  1  1+6 


By  (2.21a),  for  e  (and  6)  fixed  there  is  a  tjCe)  s.t. 


(2.28a) 


P(M  <  1-6)  <  e  for  t  >  tj(e) 


(2.28b) 


P[M  >  1+6 J  <  e  for  t  >  tj(e)  . 


By  (2.21b),  for  e  fixed  there  is  a  t^fe)  s.t. 


(2.29) 


/  M  <  1+e  for  t  >  t7(e)  . 
0  1 


Thus  for  t  >  t3(e)  *  max(tj  (e),t2(e)), 


®  1+6  1-6 

(2.30)  /  M  <  1+e  -  /  M  -  /  M  <  1+e  -  (l-6)(l-2e)  -  3e  +  6 

1+6  1-6  0 


2c6. 
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Combining  (2.25),  (2.26),  (2.28a),  (2.27)  and  (2.30) 


(2.31)  |j  m(G(M)  -  G*)  <  B[4e*6-2efi]  ♦  e(l+6)  whenever  t  >  tj(e), 


thus  proving  (2.23a). 

To  prove  (2.23b),  note  from  (2.22b)  that  there  is  a  t^fe)  such  that 
(for  e  given) 


|G(t/M)  -  G*|  <  e  if  M  <  1*6  and  t  >  t^e), 


so  that 


(2.32) 


/  M(G(t/M)  -  G*)  <  e(l+6)  for  t  >  t.(e)  . 

0  4 


Also,  as  before 


(2.33) 


/  M(G(t/M)  -  G*)|  <  B  /  M  . 
1*6  1*6 


Using  (2.33)  and  (2.32)  with  (2.30)  gives  the  desired  result  and  completes  the 
proof.  As  a  consequence  of  the  above  we  have 


Corollary  1:  Let  m  in  (2.10)  exceed  1  ♦  (2/k).  Then 


00  ^ 

(2.34)  lim  (l/r)E(X*  -  6*)2  *  (1/A*)  /  x2dll«(x  +  Aj/SVr)  <  1 


(2.35) 


lim  (l/r)E(X*  -  0*r  -  (1/A*)  <  1  . 
r*0 
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Proof :  Let  M  =  (A*o2/rN)  and  apply  (2.11c),  (2. lid),  and  (2.20).  For 

(2.34),  let  t  ■  a  and  use  (2.12),  writing  (A^v^J/o)  as  (A VA*/rM)  and 
use  (2.18),  with  (2.23a).  For  (2.35),  let  t  »  (1/r)  and  use  (2.12),  writing 
(Ai^j/o)  as  (Aj/ftnTM),  and  use  (2.14)  with  (2.23b). 

3.  ELIMINATING  ALL  NUISANCE  PARAMETERS 

For  normal  distributions,  a  natural  sequential  procedure  which  attempts 
to  use  all  of  the  sample  information  regarding  nuisance  parameters  would  be 
a  combination  of  the  rule  studied  in  Section  2  with  the  rule  considered  in 
Blumenthal  [1]  for  the  case  of  known  a  .  In  particular,  such  a  procedure 
would  use  the  stopping  rule:  stop  for  the  first  n  such  that 

oo 

(3.D  r  >  (S2/n)  /  y2dn*[y  ♦  (^  n^/Sn)] 

2 

where  S  is  given  by  (2.9)  and  A,  is  a  consistent  estimate  of  A.,  such 

M  ljll  1 

2 

as  (X*  -X.).  When  o  is  not  known,  it  is  not  possible  to  construct  a 

i,n 

two  sample  procedure  whose  sample  site  will  be  very  nearly  the  same  as  that 
of  the  sequential  procedure,  as  was  done  in  [1],  Only  the  sequential  proce¬ 
dure  will  be  studied  here. 

As  in  (1),  an  alternative  stopping  rule  will  be  defined  in  order  to 
avoid  difficulties  which  may  be  caused  by  the  behavior  of  the  function 

°0  j( 

(3.2)  H(x1,...,xk  l)  =  /  y2d  n  $(y  ♦  x.),  0^  =  °)  • 

-®  i=l 

Namely,  let  n  be  the  solution  of 
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(3.3) 


n2  -  H(aa  nn/v7 &k  l  nn^//F) 


(where  we  label  so  that  A^  n  =  0),  and  let 


(3.4) 


n  =  Cn2S2AVrl 


where  fx*)  is  the  smallest  integer  not  less  than  x.  Sampling  is  stopped 

at  the  first  n  such  that  n  ^  r(.  For  a  discussion  of  the  relation  of  this 

stopping  rule  to  that  of  (3.1),  see  [1J. 

2 

Since  S  is  independent  of  the  X.  ,  hence  of  the  A.  ,  the  limit - 
n  r  i,n’  i,n* 

ing  processes  of  Theorems  1  of  this  paper  and  [1]  can  be  combined  to  give: 

Theorem  3:  If  all  A^  >0,  1  <  i  <  k-1,  then  for  the  stopping  rule  described 
after  (3.4),  under  the  assumptions  of  Theorem  1  of  this  paper,  and  Theorem  1 
of  [1], 


(3.5) 


lim  (N/X)  ■  lira  E(N/X)  -  1  . 
r+0  r*0 


Proof :  The  stopping  rule  is  of  the  form  (2.10),  with 


An  •  H(ilin^/Sn . 


At  stopping,  («^N/Sn)  >  /AN/r  >  /R*/r,  so  that  P(A.  n/N/Sn  >  T,  1  <_  i  k-1 } 
can  be  made  arbitrarily  close  to  unity  for  fixed  T  by  choosing  r  sufficiently 
small.  Thus  Aj^  converges  with  probability  one  to  unity  as  r  decreases, 
and  (3.5)  can  be  derived  as  though  A^  converged  deterministically  to  unity. 

This  completes  the  proof. 


i 

; 


If  some  A^  *  0,  or  if  the  A^  are  proportional  to  /r ,  or  if  the  A^ 
are  fixed  and  positive,  but  the  limit  is  taken  as  o  -*•  «  then  as  in  Theorem  2 
of  [1],  there  will  be  no  probabi’ity  one  limit  for  (N/X)  as  X  ■*  <*>,  but  it 
can  be  seen  that  (N/X)  will  have  some  limiting  distribution  whose  mass  is 
concentrated  on  (R*,A*)  (in  the  limit,  it  will  behave  as  though  a  were 
known),  where  R*  =  inf  H(Xj,...,Xj^j). 
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APPENDIX 


Proof  of  Theorem  2:  Since  vhe  proof  given  by  Starr  [8]  of  his  Theorem  3  can 
be  used  with  only  trivial  modification,  our  proof  will  be  given  in  outline 
form  only. 

Define 

(A.l)  n°  »  A*A,  =  k(n-l)S^/o^,  £(n,A)  =  kn(n-l)/A*A  . 

The  stopping  rule  is: 

(A. 2)  stop  for  the  smallest  n  >,  m  s.t.  V  <  £(n,A), 

where  has  a  chi  square  distribution  with  k(n-l)  degrees  of  freedom. 

Let  e  >  0  be  given  and  define 

(A. 3)  a  =  (l-e)1/u)n°,  6  =  (l+e)1/(in0 

IIj  =  nf  UP  (N  =  m) 

n2  =  6-U)P  (m  <  N  4  0) 

n  =  l  n”uP(N  =  n) 
n=m+l 

n4  =  o_U)P  (N  i  a) 

Noting  that 


16 


17 


i 


and  using 


en‘u  >  rij  +  n2> 


Starr's  argument  yields, 


(A. 4)  lira  inf(n°)a,EN'u  >  a(m,aj)lim  <(n°)a>(t(m,X))  2  )  +  1-6  (C<6 


1<(ra-in 


-  6(e)  <  1) 


where 


(A.  5) 


a(m,aj)  =  (raaik(ra-l)r(kCA~1))2  2 


kfrdi.iY1 


The  next  step  uses 


(A.  6) 


en  <,  hj  +  n3  ♦  n4  . 


First,  it  is  easily  checked  that 


(A.  7) 


Next, 


Rj  <  a(m.O(Mm,A))k(n,-1)/2 


a  f  k(n-lj-|  *(n,A)  ■ 

(A.8)  n  <  l  na)r(lii2dl)2  2  f  ) 

m+1  L.  1  J  J  L 


fc(n-lh"  t (n, A) r  k(n-l)+(p+l) 

2  f  I  2 


1  -x/2l  - 
e  x 


(P*l)/2, 


Define 


(A.  9) 


h(n,A)  *  (fc(n, A)/k(n-l))  **  n/A*A  =  n/n  . 


By  the  definition  of  a,  (A. 3), 


(A. 10)  h(n, A)  <  (l-e)1/UJ  =  1-?  (0<C  -  £(e)  <  1)  for  all  n  <  a. 


I 


so  that 


18 


(A. 11)  £(n,A)  £  k(n-l)(l-0  <  k(n-l)  +  (p -1 )  for  n  ^  2,  0  <  1-S  <  p  <  1  . 


k(n-l)  *p-l 

From  the  fact  that  fx  e”  x^2 


achieves  its  maximum  at  k(n-l)  +  p-1. 


and  (A.  11),  the  integral  in  (A. 8)  is  bounded  above  by 


(A. 12) 


k(n-l)  ♦  p-1 

(*(n,A))  2  e_A 


£(n,A) 


(n,A)/2  J  x-(p+] 


Furthermore, 


(A.  13) 


k(n-l)-2/2 


-k(n-l)/2 


so  that  the  inequality  (A. 8)  becomes 


Hj<  (1-p)'1  l  {n_a,[k(n-l)]2'k(n’1)/2  ek(n'1)/2)e‘ £(n,A)/2(£(n,A))k(n"1)/2 
m+1 

(A. 14) 

-  k(l-p)-1  l  („-“(„-l))[e(1-h‘n'X»h(n,A)J,l(n-"-1)/2[e(1-h(n-1>)h(r,A)]k,n/2. 
m+1 


Denote 


(A. 15) 


t  (n,A)  «  h(n,A)e(1'h  rn,A1)  . 


From  (A. 9)  and  (A. 10) 


(A. 16) 


(A(n,A))km/2  <  2 (n/ A* A ) km^ 2 


A(n,A)  <  (l-C)e^  <1  n  <,  a,  0  <  £  <  1 


(A. 17) 
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Thus,  using  (A. 16)  and  (A. 17)  in  (A. 14), 

(A. 18)  n  <  (An)_km/2  e^kQ-p)"1  l  [n"a,(n-l)nkm/2)(A(n,X))k(n'm‘1)/2  . 

ra+1 

From  (A.  17)  and  the  ratio  rule  for  series  convergence,  we  obtain 
(A. 19)  n3<GA_km/2 

where  G  is  a  constant  independent  of  X.  Combining  (A. 6),  (A. 7),  and  (A.  19), 
gives 

(A. 20)  (n°)V“<a(m,u)(nV(t(«,)i))k(”‘1)/2+  (n°)“x'klll/2G  +  (l-e)'1P(N  >  a) . 

Definitions  (A.l)  imply  that 

(A. 21)  lim(n0)w(£(m,o))k(m‘1)/2  <  -  -*  lim[(n°)“x’km/2Gl  -  0  . 

X-HJO  X-*a> 

Using  (A. 20),  (A. 21)  and  (2.11c)  gives 

(A. 22)  lim  supfnVEN''*1  <  a(m,(1))lim[^°)0,(fc(m,X))kCm‘1)/2]  +  1+6’, 

X-H»  X-*» 

(0  <  6’  »  6’(c)  <  1)  . 


Combining  (A. 22)  and  (A. 4)  gives 

(A. 23)  lim(n°)a>EN-<*)  =  1  +  a(m,w)lim[ (n°A£ (m,X))k . 

A-*»  A-*® 


From  (A.l),  we  have 

(A. 24)  (n0)u(£(m,X))k(m"1)/2  .  [km(m-l)]k(m_15/2 (A*X)a,'(k(m_1)/2)  . 


Using  (A. 24)  and  (A. 5)  in  (A. 23)  yields  (2.20),  completing  the  proof. 


